Adaptive critic based approximate dynamic programming: A new tool for smart manufacturing
نویسندگان
چکیده
This work supported in part by the National Science Foundation under grant ECS-9904378. Abstract Adaptive critic based approximate dynamic programming techniques are gradient based methods for finding optimal policies for multi-stage decision processes. We believe adaptive critic methods are now developed to the point that they can be applied to the full spectrum of decision and control problems, including inventory control and job scheduling problems of interest in manufacturing. In this paper we illustrate the use one such technique for the development of a policy manager for a distribution system. We compare the performance of both neural net based and fuzzy rule based policy managers with that of LP and GA based fixed policies. Both adaptive critic based soft computing techniques outperform the fixed policies for stationary and nonstationary stochastic demand conditions.
منابع مشابه
A New Hybrid Critic-training Method for Approximate Dynamic Programming
A variety of methods for developing quasi-optimal intelligent control systems using reinforcement learning techniques based on adaptive critics have appeared in recent years. This paper reviews the family of approximate dynamic programming techniques based on adaptive critic methods and introduces a new hybrid critic training method.
متن کاملAn Introduction to Adaptive Critic Control: A Paradigm Based on Approximate Dynamic Programming
Adaptive critic control is an advanced control technology developed for nonlinear dynamical systems in recent years. It is based on the idea of approximate dynamic programming. Dynamic programming was introduced by Bellman in the 1950’s for solving optimal control problems of nonlinear dynamical systems. Due to its high computational complexity, applications of dynamic programming have been lim...
متن کاملImproving quasi-optimal inventory and transportation policies using adaptive critic based approximate dynamic programming
We demonstrate the possibility of optimal control of physical inventory systems in a nonstationary fitness terrain, based on the combined application of evolutionary search and adaptive critic terrain following. We show that adaptive critic based approximate dynamic programming techniques based on plant-controller Jacobeans can be used with systems characterized by discrete valued states and co...
متن کاملPartial, noisy and qualitative models for adaptive critic based neurocontrol
The roles of plant models in adaptive critic methods for approximate dynamic programming are considered, with primary focus given to the DHP methodology. In place of complete system identification, partial, approximate, and qualitative models of plant dynamics are considered. Such models are found to be sufficient for successful controller design. As classification is in general easier than reg...
متن کاملMidcourse guidance law with neural networks
A dual neural network ‘adaptive critic approach’ is used in this study to generate midcourse guidance commands for a missile to reach a predicted impact point while maximizing its final velocity. The adaptive critic approach is based on approximate dynamic programming. The first network, called a ‘critic’, network, outputs the Lagrangian multipliers arising in an optimal control formulation whi...
متن کامل